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Abstract — Presented here is a Predictor-Based Model 
Reference Adaptive Control (PMRAC) architecture for a 
generic transport aircraft. At its core, this architecture features 
a three-axis, non-linear, dynamic-inversion controller. 
Command inputs for this baseline controller are provided by 
pilot roll-rate, pitch-rate, and sideslip commands. This paper 
will first thoroughly present the baseline controller followed by 
a description of the PMRAC adaptive augmentation to this 
control system. Results are presented via a full-scale, non- 
linear simulation of NASA’s Generic Transport Model (GTM). 

I. INTRODUCTION 

T he Integrated Resilient Aircraft Control project (IRAC) 
is a part of the NASA Aviation Safety program. A key 
focus of this project is to investigate adaptive control 
systems as a risk-mitigating technology for off-nominal 
conditions. This paper, based upon the research presented in 
[5], proposes a PMRAC adaptive control architecture as just 
such a candidate technology. 

To begin, section II presents the baseline control 
architecture and justifies the design selection. Section III 
then presents the PMRAC adaptive augmentation to this 
system. Here some of the fundamental stability proofs for 
the architecture are shown, but the interested reader is 
referred to specific publications for further detail. Section IV 
presents results from a full non-linear simulation of the 
control system, both with, and without the PMRAC adaptive 
augmentation. Finally, section V presents a brief summation 
of this paper’s contribution to the community. 

The primary intentions of this paper are two fold: to 
present the PMRAC architecture for a complete nonlinear 
dynamic inversion baseline controller (a novel contribution 
to the community in as far as the authors are aware) and to 
demonstrate the PMRAC architecture on a non-linear flight 
simulation. Future work will be aimed at more thoroughly 
assessing the strengths and weaknesses of this control 
strategy. 
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II. Baseline Controller: Nonlinear Dynamic 
Inversion 

A. Control Selection Justification 

The baseline controller for this work is a full non-linear 
dynamic inversion controller based in large part on [1], [2], 
and [3]. This control architecture is chosen for several key 
reasons. First, this control approach can be effectively 
implemented as evinced by the recent selection of a dynamic 
inversion controller for the F-35 aircraft. Second, and more 
importantly, a dynamic inversion controller offers a cost and 
time effective way to develop a control system; with 
appropriate modeling, a full-flight control system can be 
quickly and efficiently developed for research studies, as 
contrasted with a more time intensive traditional gain- 
scheduled controller. This ultimately facilitates the rapid 
evaluation and testing of multiple adaptive control systems 
over a large range of flight conditions. 

B. Control Architecture 

The general (and well known) rigid body dynamics for an 
aircraft are presented below: 

r = Id ) + oo x la ) . (1) 

Here, co is G $R 3X1 and represents a vector of the roll (p ), 
pitch ( q ), and yaw (r) rates. By modeling the torques on the 
aircraft using traditional aerodynamic stability derivatives 
(including p , q , and r derivatives), (1) may be decomposed 
into the following convenient form: 

co = A(t)co + G(t)z + B(t)u (2) 

For this decomposition, the matrices A (E 9t 3X3 ), G (E 
9t 3X7 ), and B (E 9t 3X3 ) are time-varying (from this point 
forward the time dependence of these variables will not be 
expressly shown). Moreover, the aircraft’s control allocation 
tables are incorporated into (1) such that the control vector u 
is E £R 3X1 and represents the three, non-dimensional lateral, 
longitudinal, and directional control signals (limited from -1 
to 1). The vector z represents a non-linear combination of co, 
specifically pq , qr , pr , {p 2 - q 2 ), (r 2 - p 2 ), and ( q 2 - r 2 ), as well 
as a bias term (which accounts for the dependence of x on 
slowly changing variables, such as angle of attack and side- 
slip). 



A non-linear dynamic inversion control law is then: 


u = B(t) 1 (a>^ -A(t)m - G(t)z) . (3) 

For clarity, over-hats have been used to denote that 
parameters in the dynamic inversion control law are 
estimated; look-up tables provide these matrices in real-time 
operation. The reader should be careful not to confuse the 
dynamic inversion parameters with real-time, adaptively 
estimated parameters that will be introduced in later 
sections. Finally, the subscript d denotes a desired value. 

To generate the desired values in (3), each of the three 
directional axes is treated independently. For the 
longitudinal and lateral axes, the pilot’s stick commands are 
interpreted as pitch and roll rate commands. These 
commands are then filtered to produce a tracking signal for a 
linear PI control law. The control signal is further 
augmented by a feedforward term from the filter states. The 
resulting control signals are the desired pitch and roll rates 
for the dynamic inversion in (3). 

For the directional axis, the pilot’s pedal inputs are not 
interpreted directly as yaw rate commands, but instead as 
sideslip commands. As a result, an additional dynamic 
inversion is employed to generate the necessary yaw rate 
command; this is similar to the work done in [4] in which a 
two stage slow and fast dynamic inversion architecture was 
proposed (in this work the fast dynamic inversion is similar 
to the dynamic inversion shown in (3)). More precisely, the 
pilot’s sideslip command is filtered to produce a tracking 
command for a proportional controller. This proportional 
control signal is augmented with a feedforward term from 
the filter states to generate a desired sideslip command. 
From this, a dynamic inversion is performed to generate a 
yaw rate command for the fast dynamic inversion in (3). The 
desired yaw rate command is then generated identically to 
the lateral and longitudinal axes, i.e. the desired yaw rate is 
generated from a linear PI controller augmented with a 
feedforward term. 

For the slow dynamic inversion, the following well- 
known relationship is used as a starting point for derivation: 

P = —^— [Dsin(/3) + F cos (p)-X T cos(a)sin(^) + 
mV 

mg (cos a sin sin 6 + cos ^ sin 0 cos 0 - (4) 

sin a sin p cos 0 cos 6 + psina-rcosa)] 



Fig. 1. High-level base-line control architecture. Here the 
dependency of the sideslip inversion on additional states (a, 6, (/>, p, 
and V t ) is not illustrated. 



Fig. 2. Presented here is the inner-loop fast dynamics controller. The 
D.I. block contains the equation presented in (3). 


The above expression is simplified to the more tractable 
relationship shown in (5). 

P « p sin(a) - r cos(a) + (glV t )cos(0)sin(0). (5) 

The yaw rate command is then determined from (5), as 
shown in (6). 

r = ~(P - p sin(a) - (g/V ) cos(0)sin(0))/cos(a) (6) 
com d t 

The complete architecture is presented in both Fig. 1 and 
Fig. 2. Specifically, Fig. 1 illustrates the outer loop control 
architecture while Fig. 2 shows the fast dynamic inversion 
using all three of the pitch, roll, and yaw rate commands. It 
is worth noting that the desired angular accelerations for this 
architecture (as diagramed in Fig. 2) are given below in (7). 

t 

cb d = K p (co f -co) + Kj f(cOf -co)dr +co f (7) 

o 

Here, the output of the command filter is expressly 
represented with a subscript f Moreover, it should be noted 
that both K p and Kj are constant diagonal, 3x3 matrices. 

As a final note on the general architecture, the pilot’s 
commanded pitch rate is augmented by the level turn- 
compensation term g sin 2 0/Fcos0. Qualitatively, this helps 
keep the nose of the aircraft up during bank and roll 
maneuvers. 

III. PMRAC Augmentation 

The PMRAC architecture adjusts a typical MRAC 
architecture by adding an additional, predictive training 
error. The following section will develop the traditional 
MRAC reference error as well as the PMRAC prediction 
error. It should be noted that the PMRAC augmentation is 
only applied to the inner most loop of this controller, i.e. it is 
applied to the fast dynamic inversion illustrated in Fig. 2. 

A. Preliminaries 

To begin defining the PMRAC adaptive augmentation, we 
manipulate the general system model of (2) by incorporating 
the true matrix values A, B , and G with their controller 
estimates. 

co = A co + Gz + Bu + A Aco + A Gz + ABu (8) 


and 


K T =B~ l G. 


(17) 


In (8), A denotes the difference between the true matrices 
and the estimated values used by the controller. To further 
develop the controller, the result in (8) is transformed into 
the more familiar adaptive controller A and B matrix form, 
as presented in (9). 


co = Aco + B(u + f) (9) 

The term / is here introduced for convenience and comprises 
the model uncertainty g. It should be further noted that B is 
assumed for a nominal aircraft and is thus taken as full-rank 
throughout the entirety of this paper. Comparing (8) and (9), 
the term /is then defined as 

f = B- 1 g + B- l Gz, (10) 

where 

g = (AAco + ABu + AGz ) . (11) 

In much of the adaptive control literature an additional term 
is added to (10) to approximate non-parametric uncertainty. 
This term is significant from a stability proof and analysis 
perspective, but is ignored here for the purposes of 
developing the correct control laws and PMRAC predictor 
system. 

To facilitate further development, the command filter in 
Fig. 2 is represented in state space form as in (12). 

<»f= A m CO f+ B m r ( 12 ) 

Here the reference input r is provided from the pilot 
commanded roll and pitch rates and the outer-loop side-slip 
controller. The filter is assumed to be a first order filter, as 
illustrated in Fig. 2. 

B. PMRAC Predictor System Development 

For the PMRAC development presented here, the 
predictor system is defined using the following state vector: 



( t ) 

r ( t ) 

T 

X = 

f co f dx 

co T f J codx 

1 T 

CO 


\0 ) 

* \0 ) 



Combining (3), (7), (12), and (13) as well as introducing a 
currently undefined adaptive control term, the control law is 
rewritten as 

u = K x x + r- z- B~ l u ad , (14) 

where 

K T x =[b- X K, B~\K p+ A m ) -B-'k, -B~\k p +A)], (15) 
K T r =B- l B m , (16) 


It should be observed that x G SR 12X1 , K x G SR 12X3 , K, G Sl 3X3 , 
K z G 9t 7X3 , u ad G 9t 3X1 , and u G 9t 3X1 . 

The system in (9), given the above definitions, is now 
easily written as 


x = Ax + B m r+B[K x x + K P r+B~ l (g-u ad )] (18) 
In (18), the following definitions have been introduced: 




0/00' 





0 A m 0 0 




A = 

0 0 0 / 

0 0 0 A 

? 

(19) 



r t i 

\ T 



B m = 

[o B T m 0 o] 

? 

(20) 

and 



r 



B =[ 

0 0 0 


(21) 

To complete the manipulation of the system dynamics, we 

can rewrite the system as 





x = A r x + B r r + B,(g-u ad ), 

(22) 

where 

A , 

. =(A + BK t x ) , 


(23) 


B r 

= (B m+ BK' r ), 


(24) 

and 






b] =[o o o /], bJ efft 3 * 12 . 

(25) 


It should be noted that the term g in (22) is time varying but 
that A r and B r are constant (time varying elements of these 
matrices end up cancelling to 0). 

The reference system for the control architecture is then 
the following: 


x r = A r x r +B r r . (26) 

Inspection of this system shows that the reference system is 
merely the true states following the filtered states in (12) 
(assuming identical initial conditions for both). Additionally, 
the system in (26) is a linear time-invariant system (LTI). 

For the PMRAC architecture, following the approach in 
[5], the predictor system is now the following: 


x = A (x- x)+A r x + B r r . 


(27) 


In (27), the matrix A p is G 91 12X12 and can be decomposed 



(37) 


into 4 sub-blocks that are each an element of 3t 6X6 , as shown 
in (28). 


A, (1,1) A, (1,2) 
A, (2,1) A, (2, 2)’ 


(28) 


For this work, we define A p as a positive multiplicative 
factor of A r , which implies that A p (l,2) is a 0 matrix. 


C. Adaptive Control Laws 

To complete the adaptive architecture, we begin by 
defining two error signals: 


g = W t T x + Wjr + W 3 T z + W 4 T u ad 

The adaptive control signal is now defined as below: 

u ad = Wfx + Wjr + W 3 T z + w 4 u ad (38) 

The adaptive control laws are given, once again following 
the work of [5], as in (39). 

W, = r.ProjfW, ,-x(e T r PB e + e T p P p B e )) 


e = x- x 
e = x- x 


(29) 


The respective error dynamics are then the following: 


W 2 = r 2 Proj(W 2 ,-r(e T r PB e +e T p P p B e )) 
W 3 = r 3 Proj(W 3 ,-u ad (e T r PB e+ e T p P p B e )) 
= r 4 Proj(W 4 ,-z{e T r PB e + e' p P p B e )) 


e = A r e + B l (u ad -g) 
e = A p e + B l (u ad -g) 


(30) 


Inspection of (30) reveals that the error terms associated 
with the filter-states are identically zero. More precisely, the 
reference model and predictor model reproduce the filter 
states in x exactly (assuming identical initial conditions and 
that A p (l,2) = 0 ), thus these error terms effectively cancel. 
The error dynamics can then be represented in a reduced 
form as 


where 


C = A e e r +B e( U ad~S ) 

*p = A pe^p +B e( U ad~§y 



(31) 


(32) 





(aj f -co) 7 


(33) 


0 / 



(34) 


A pe =A p { 2,2), (35) 

and 

B T e =[ 0 /]. (36) 

In (32), do is the fourth state of the predictor vector x. 
Moreover, in (33) it is important to note that the filter states 
appear because the reference model specified roll, pitch, and 
yaw rates are precisely the filter states when the reference 
model and true system have identical initial conditions, as 
was discussed in the previous section. 

The term g (as defined in (1 1)) may now be expressed as 
a function of x, u ad , r, z and a series of time varying weights. 


In order to be precise on matrix dimensions, we note that Wj 
G Si l2x \ W 2 G 3i 3X \ W 3 G 9i 3X3 , and W 4 G i)t 7X \ Moreover, 
as is consistent with most model adaptive control 
formulations, P and P p are the solutions to the Lyapunov 
equations A e T P + PA e = -Q and A JP p + B P A pe = ~Q P 

respectively, where P, P p , Q, and Q p are all positive definite, 
symmetric matrices with Q and Q p designer selected, tunable 
parameters. Finally, the ‘T” terms are the respective 
adaptive learning rates for each unknown parameter. It is 
here worth noting that the matrices in (34) and (35) are 
constant for the error dynamics, irrespective of the fact that 
the system is actually time varying. 

D. Theoretical Analysis 

To show the correctness of the development, three 
theoretical issues are addressed here: the model matching 
conditions of the general system in (22), the boundedness of 
the true and predicted system states, and the boundedness of 
the two error signals. As aforementioned, all uncertainty has 
been treated as parametric, thereby simplifying the 
presentation here. If non-parametric uncertainty were 
considered, the theoretical development would be further 
complicated. 

1) Model Matching Conditions 

The model matching conditions represent the necessary 
conditions for any control signal to fully cancel the system 
uncertainty g. If the model matching conditions are satisfied, 
it is possible for the system to perform exactly as specified 
in (26). In the current context, if (26) is realized the aircraft 
will behave as if the dynamic inversion is perfect; the 
system performance will be completely specified by the 
command filter in (12) and the choice of PI gains Kj and K p . 
The model matching conditions for this system can be 
determined by combining (10), (14), and (22) to obtain the 
proceeding result. 

k = Ax + B m r + B{K T x x + K T r r-B- l u ad ^ 

+ B~ l (A Ax + A Bu + A Gz)] 



In (40) the term AA is introduced to ensure dimensional 
consistency and is defined as below. 

AA = [0 0 0 A A] (41) 

Using the definition of each of the A terms, the expression in 
(40) can be reduced to 

x = Ax + Br + BX(BK T x + A A)x + BKj- Bu ad + (G - BK T z )z] . (42) 


weights. The use of the projection operator in the control 
laws then ensures that the A terms are bounded. More 
precisely, we must assume that 

w;(oeWi, w 2 (t)<=w 2 , w 3 (t)ew 3 , w 4 (t)(=w 4 Vt, (46) 

where W denotes a known compact set and 

IKi(0||^^,|K(0|h^,||W3(0||£^-IK(0|h^4 Vf. (47) 


Comparing (42) with (26) and temporarily assuming that the 
B matrix is known (this negates the need for the term W 4 r in 
(38)), we see that the conditions in (43) must hold in order 
for the uncertainty to be fully canceled. 

A+B t BK T x + B l AA = A r 

B m +B t BK T r = B (43) 

G-BK T = 0 

More precisely put, there must exist gains K x , K n and K z 
such that the expressions in (43) are satisfiable in order for 
the reference model in (34) to be fully-achievable. These 
conditions are relevant to any adaptive control technology in 
that they explicitly indicate that the control objectives (to 
track the reference model) may not be achievable if the 
system’s B matrix is not of full rank (or the uncertainty is 
not in the span of B). However, because the B matrix is not 
known, the adaptive control signal includes itself and there 
is a fixed-point problem. This is commensurate with the 
development presented in [1]. It should be stressed that 
making specific assumptions about the uncertainty in B can 
eliminate the fixed-point problem. 

2) Bounded Error Dynamics 

The error dynamics are already presented in (30-36). To 
show that the error dynamics are bounded, we use the 
candidate Lyapunov function presented in (44). 

V = e T r Pe r +e T p P p e p + trace (AW } T Tf AW, ) + 

trace(AWl T" 1 AW 2 ) + trace(AW 3 r T~ 1 AW 3 ) + . (44) 
trace(AW A TfAW 4 ) 

Here, A denotes the difference between the predicted and 
true weight value. Examining (2), (11), and (37), it is clear 
that the unknown weights must be time varying. Taking the 
derivative of (44) and substituting the error dynamics in 
(30), as well as the control laws in (39) then yields 

V = -e T r Qe r -e T p Q p e p - 2trace(AW \ T TfW \) 

- 2trace(AW 2 TfW 2 ) - 2trace(AW 3 T TfW 3 ) . (45) 

- 2trace(AW 4 T 4 l W 4 ) 

As a result of the time varying nature of the weights, the 
stability proof requires the existence of bounds on each of 
the weights as well as bounds on the derivatives of the 


If these bounds exist, the system errors (both the predicted 
and actual errors) are uniformly ultimately bounded. 
Moreover, because the reference system is bounded-input 
bounded-output stable, it follows that both the true system 
states and the predicted system states are bounded as well. It 
should be noted that the result in (45) is derived from the 
approach taken in [6] for an Lj adaptive controller. 

IV. Simulation Results 

To investigate the functionality of the system, we use an 
up-scaled version of NASA’s Generic Transport Model 
(GTM - developed at NASA Langley Research Center); this 
model is intended to represent a scaled version of a two 
engine commercial transport aircraft [7]. Using real 
aerodynamic data, the above controller is simulated in a full- 
nonlinear simulation environment. As an added caveat, data 
for the dynamic inversion is collected using the vortex- 
lattice code base VorView [8]; this ensures a separation 
between the true aircraft model and the dynamic inversion. 
For the purposes of this paper, we consider an aircraft 
performing a doublet maneuver in roll, pitch, and sideslip. In 
one scenario, we consider a nominal aircraft; in the second 
scenario, we consider an aircraft having lost control of the 
left elevator and its stab. Presented in Fig. 3 are the complete 
results of this comparison; Figure 4 presents an enlarged 
view of the pitch tracking. 



Fig. 3. Damage and undamaged aircraft performing doublet maneuver 
with baseline controller. 
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Fig. 4. Enlarged view of pitch tracking from Fig. 3. 


From the results presented in Fig. 3 and Fig. 4, it is now 
desired to arrest the pitch deviation at onset of the damage as 
well as return the undamaged tracking performance by using 
an adaptive PMRAC controller. For the purpose of 
comparison, results are also presented for an MRAC 
controller using identical adaptive learning rates to that of 
the PMRAC controller. Results are presented in Fig. 5-7. 

As can be seen from the results, the PMRAC augmented 
controller has demonstrably fewer oscillations than MRAC 
(at the given learning rates). Additionally, both controllers 
have reduced the initial pitch loss and recovered the pitch 
doublet tracking performance of the nominal aircraft. As is 
consistent with the literature, the control activity for 
PMRAC is also significantly reduced as compared to 
MRAC. It is worth noting that the MRAC performance may 
be improved with gain reduction and the inclusion of 
normalized learning rates. 

V. Conclusion 

This paper has presented a baseline nonlinear dynamic 
inversion controller with a PMRAC adaptive augmentation 
component. Simulation results have demonstrated that this 
architecture can improve tracking performance compared to 
both the un-augmented baseline controller and a traditional 
MRAC controller. 



Fig. 5. Results for GTM aircraft with adaptive control augmentation. 



Fig. 6. Enlarged view of roll tracking from Fig. 5. 



Fig. 7. Enlarged view of pitch tracking from Fig. 5. 
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